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DECLARATION OF CHAITAN KHOSLA 

Assistant Commissioner for Patents 
Washington, D.C. 20231 

Dear Sir: 

I, Chaitan Khosla, declare as follows: 

1 . I am a Professor of Chemical Engineering and Chemistry at Stanford University, 
a co-founder, Board Member, and consultant of Kosan Biosciences, and a co-inventor of the 
above-referenced application. I have actively conducted research in the field of polyketide 
synthesis for over 10 years. A copy of my curriculum vitae is attached. 

2. I understand that the specification in the above-referenced case has been criticized 
as failing to provide sufficient information to show that I and my co-inventors were in possession 
of our invention and to provide sufficient information to enable others of skill in the present art 
to practice the invention as claimed. The basis for this criticism is that the sequences of only a 
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few of the existing intramolecular and intermolecular linkers which facilitate polyketide chain 
transfer are disclosed, and that these sequences do not show a high degree of homology. 
Therefore, it is reasoned that one of ordinary skill would not be able to practice the invention 
either because other linkers besides those articulated are not described in the specification, or 
because it would not be possible for the practitioner readily to find such linkers without explicit 
description of them in the specification. 

3. I believe this perception is in error and can verify that, as practitioners of the art, 
both I and my graduate students, post-doctoral students and colleagues in the field would readily 
be able to obtain from the published literature, familiar to us as practitioners, the desired linker 
structures. 

4. As an example, I believe this is made clear in Chapter 2 of the thesis submission 
of one of my graduate students, which is appended to the response to the Office action in the 
above-referenced application as Exhibit B. The chapter contains a description of work 
demonstrating the ability of intermodular linkers to facilitate the growth of a nascent polyketide 
chain. This thesis was submitted November 1, 2000, and is based on publicly available 
information regarding polyketide sequences. All sequences from the ery, rap, FK506, rif and pik 
clusters were available as the date of the application to which priority is claimed, 9 February 
1999; as were sequences of ole5-6 in Figure 2.10 and ole5 in Figure 2.12 . The remaining 
sequences were deposited in the database after that date. 

5. Of particular interest are the tables set forth on pages 14-16 of this chapter, which 
show intrapolypeptide linkers (Figure 2.10 on page 14) and interpolypeptide linkers 

(Figures 2.1 1 and 2.12 on pages 15 and 16, respectively). As indicated in those figures, the 
C-termini of the ACP catalytic regions and the N-termini of the KS regions of a wide range of 
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already sequenced modular polyketide synthases show such a high degree of homology that 
identification of the linker moieties between these modules is readily apparent. We do not 
expect there to be substantial homology among linkers, especially with regard to the 
interpolypeptide linkers for which the portion bound to the C-terminus of the upstream module is 
shown in Figure 2.11 and the portion bound to the N-terminus of the downstream KS is shown in 
Figure 2.12. 

6. As all of the sequences set forth in these figures are available in publications and 
databases of which one of ordinary skill in the relevant art would have knowledge, that person 
would readily be able to ascertain the amino acid sequence of either a intrapolypeptide linker or 
the two portions of an interpolypeptide linker based on these disclosures. 

7. I understand that it is also asserted that not all possible modular polyketide 
synthases have yet been cloned and sequenced. Undoubtedly this is true; however, because of 
the striking homology exhibited by the catalytic regions in the multiple PKS whose sequences 
are already available, there appears no doubt that any subsequently cloned and sequenced 
modular PKS will exhibit a similar level of homology and be amenable to the analysis discussed 
above. Hence, one of ordinary skill would could identify linkers from these synthases as well. 

8. For these reasons, the information requested as assertedly necessary to provide 
adequate disclosure of the invention as claimed in the above-referenced application is, to the 
extent it relates to PKS sequences known as of the priority date of the instant application and to 
PKS sequences later determined, readily available to, and readily accessible by, one of ordinary 
skill, and there would be no need to provide such skilled practitioners with any more direction 
than that set forth in the specification to permit successful practice of the invention as claimed 
with respect to such sequences. 
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1 I declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further, that these 
statements are made with the knowledge that willful, false statements and the like so made are 
punishable by fine or imprisonment or both, under Section 1001 of Title 18 of the United States 
Code and that such willful false statements may jeopardize the validity of the application or any 
patent issued thereon. 

Executed this _^ day of August, 2002, at Stanford, California 




Chaitan Khosla 
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Camille and Henry Dreyfus New Investigator Award, 1991; National Science 
Foundation Young Investigator Award, 1994; David and Lucile Packard 
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Principal Research Interest 

Biocatalysis 

Our research interests, which straddle the interface of chemistry, biology, and engineering, revolve 
around microbial natural products. We work on three themes pertaining to natural products: 

(1) Biosynthetic mechanisms. We are interested in understanding fundamental mechanisms underlying 
the biosynthesis of natural products. Of particular interest are multi-enzyme systems (a.k.a. 
megasynthases) which catalyze the biosynthesis of complex natural products such as polyketides and 
nonribosomal peptides. These multifunctional catalysts are remarkable in their ability to chaperone 
highly reactive biosynthetic intermediates through 10-100 distinct catalytic events. Individually they are 
exquisitely selective with respect to product functionality and stereochemistry; yet as a family of 
enzymes, they generate breathtaking diversity. Examples of our laboratory*s contributions to the current 
understanding of polyketide biosynthetic mechanisms can be found in some of the references below. 

(2) Engineered biosynthesis of "unnatural" natural products. Over the past three decades breakthroughs 
in molecular genetics have propelled a revolution in our ability to analyze and manipulate the structures 
and properties of nucleic acids and proteins. Our research is premised on the notion that nature has 
evolved similar genetic codes for the biosynthesis of polyketides and nonribosomal peptides. However, 
in contrast to nucleic acids and proteins, whose structures are directly encoded within modular 
templates, the biosynthesis of these molecules lacks the involvement of overt template molecules. 
Instead, it appears that molecular diversity arises from the modularity of these megasynthases at the 
domain/subunit level. We seek to develop novel, interdisciplinary methods for exploiting this 
modularity for the practical synthesis of new natural product analogs. Again, examples of our 
accomplishments can be found in some of the references below. 

(3) Biology of natural and "unnatural" natural products. The ability to practically manipulate the 
structures of natural products offers the opportunity to understand their multi-faceted mechanisms of 
action. Of particular interest at present to our laboratory is the mechanism of action of apoptolidin, a 
newly discovered natural product that acts as a potent and selective inducer of programmed cell death 
(apoptosis). 
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Chapter 2. Qualitative Analysis of Heterologous Module Fusions 



2.1. INTRODUCTION 

In die absence of structural information or a model of intermodular interfaces, < 
investigation of chain transfer between modules presented a challenge. The simplest 
context in which transfer occurs is the truncated DEBSl-TE construct (Figure 2.1). 
However, instead of dissecting a natural bimodular protein, such as DEBS 1 , a panel of 
heteroloeous module-module fusions were consulted. By observing the successful 
fusions (as judged by polyketide production), it was anticipated that important factors for 
chain transfer could be discovered and analyzed. 



DEBSl-TE 




OH 



or 

2 



Figure 2 1 DEBSl-TE. Truncated form of DEBS. DEBSl-TE requires 
SodilSsf^ between modules 1 and 2 to yield the tnkefcde 



lactone product. 



Although active site domains within moduli and ft. loading didomain of DEBS 
havebeensuecessfcllyreplacedthroaghgeneucmanipulationanddespiteampte 

evidence of the functional independence of modules™', successful module fusions tad 
never been reported. Wore, judicious planning was needed to select modules and 
fusion sites best suited for heterologous transfer. Because it was assumed that selec«,v,ty 
would be dictated by the recipient module, the first step required finding modules that 
demonstrated tolerance towards unnahrral substrates. At the t ime of these e.tpenments, 
preliminary work had shown that individually purified modules 2, 5, and .6 of DEBS, 
each fused with the terminal thioesterase (M2-TE, M5-TE, M6-TE), were capable of 
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recognizing and elongating the A^tylcysteamine (NAC)-diketide thioester 3 (NDK) to 
produce -the triketide lactone 2 (Figure 2.2A) 21 . Similarly, module 3 fused to the 
thioesterase (M3-TE) extendedNDK to the unreduced triketide lcetolactone 4 (Figure 
2 2B) Since NDK mimics the natural substrate of only M2-TE, modules 3, 5, and 6 each 
exhibited relaxed substrate specificity by accepting NDK. These in vitro experiments 
immediately intimated a framework for studying module fusions: replacement of M2 m 
DEBS 1-TE with M3, M5, andM6 to generate LDD-M1-M3-TE, LDD-M1-M5-TE, and 
LDD-M1-M6-TE (where "LDD" represents the loading didomain), respectively. The 
concern regarding substrate specificity was tempered because the module, would be 
presented with the same NDK substrate, except it would be via module 1, ramer than v.a 
exogenous loading. 



■ o==<^ C M2-TE, M5-TE, or M6-TE V fS^ 

un_/ ~ NADPH ~~ A<A> 

MetliyimalonyUGoA I 



o^ 0 M3-TE 
H0 ^y~ - Methylmalonyl-CoA 

3 




Figure 2.2 Individual module assays. (A) Exogenous* fed NDK is 
I epted and extended by modules 2 5, and 6 to fern the de 
lactone (B)Module3 fused to the thioesterase also accepts NDK .but 
Sd catalyzes production of the unreduced triketide ketone. 

These bimodular constmcts presented attractive targets, but previous attempts at maldng 
these fusions yielded proteins unable to catalyze in vivo the biosymhesis of the expected 
triketide lactones 2 (in me case of LDD-M1 -M6-TE) 28 or 4 (in the case of LDD-M1-M3- 
TE) 29 However, the purified proteins were able to extend NDK to their respecttve 
triketide lactones, thereby demonstrating the catalytic competence of the downstream 
fused modules 30 (Figure 2.3). 
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Based upon these results, we hypothesized that in each construct, both module 1 
and the transplanted module retained their individual activities, but simply lacked the 
proper connectivity required to allow transfer between them. 

By selecting a different site for fusion, the LDD-M1-M3-TE and LDD-M1-M6- 
TE constructs were recloned in an active form, reflected by their ability to efficiently 
catalyze die biosynthesis of the expected triketides in vivo in Streptpmyces coelicolor. 
This surprising result implicated the importance of short "linker" regions located outside 
the conserved catalytic domains. Other heterologous fusions exploiting this 
understanding suggested the true "modularity" of modules, where modules may be 
replaced within a framework of linkers that maintain the overall organization. 



S-CoA 



LDD 



Module I 



Module 6 



TE 




OH 

;:cc 



iNAC 



2 



S-CoA 

i- 



LDD 



Module 1 



Module 3 



TE 




SNAC 
H0~( 




FiEure 2 3 Previous attempts of bimodular fusion. Although the 
bimodular construe were not able to produce triketides from their 
natural substrate, propionyl-CoA 5. the downstream module was able to 
process exogenous NDK to the expected triketide lactones, 



2.2. Materials and Methods 



2.2.1. Construction of Module Fusions for 5. coelicolor Expression 

I. LDD-M1-M3-TE (pST97): This bimodular fusion contains module 1 fused to 
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DEBS1-TE 

a Module 1 . . . . n TE 

A. LDD Module 2 QH 

s — 1 2 

B. 

.CCC. GTC GAC GAG CCG ATC GCC ATC GTC GGC ATG GCG. . . 

"p V D E P I A I V G M A 

M1-M2 Linker BsaBI KS of M2 



LDD-M1-M3-TE 

r Module 1 . , . TE 
C Module j 




;-CoA 

' AT^cM<sYATYl<R>CPVfKSYATYACPYTE; 





^CCC^GTC GAC GAG CCG ATC GCC ATC . GTC GGC ATG GCG . . 

" '? v D E P I A I V G M-A 

KS of M3 

CTG GAA TCC GAC CCG ATC GCC ATC GTC AGC ATG GCC - . 

' " L E S D P I A I V • S M A 

Fusion 

CCC GTC GAC GAG CCG ATC GCC ATC GTC AGC ATG GCC . . 

"*p v D E P I A I V S M A 



M1-M2 Linker 



BsaBI KSofM3 



Figure 2.4 DEBS1-TE and LDD-M1-M3-TE. (A) Schematic diagram 
showing the major polylcetide product. Curved arrows represent where 
intermodular transfer occurs. (B) Nucleotide and amino acid sequence 
at the intermodular junction. (C) Schematic for LDD-M1-M3-TE. (D) 
Nucleotide and amino acid sequences near the splicing site of DEBS 
M2 KS and M3 KS. The composite sequences are shown below the 
contributing sequences. Nucleotides and amino acids of the enzyme 
recognition site are highlighted in bold font 

module 3 by talcing advantage of the natural, conserved Bsa BI site located at the start of 
the ICS domain. Thus, the "linker" sequence bridging the fused modules is the natural 
sequence between modules 1 and 2, as in DEBS1. Module 3 fused to the thioesterase 
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domain was supplied as a Bsa BI-Eco RI fragment from a derivative of pRSG34 27 . This 
fragment was cloned behind Ml in a derivative of pCK12 20 to generate pST97 27 (Figure 
2.4 C and D). 

II. LDD-M1-M5-TE (pST142): Tliis bimodular fusion was constructed in an 
identical manner as P ST97, except it contains module 5 instead of module 3 . Thus, it 
also maintains the natural M1-M2 linker between modules 1 and 5. Module 5 fused to 
the thioesterase domain was supplied as a Bsa BI-Eco RI fragment from a derivative of 
P RSG46 27 . This fragment was cloned behind Ml in a derivative of pCK12 to generate 
pST142 (Figure 2.5 A andB). 



LDD-M1-M5-TE 

A. • LDD Module5 j ggj 




5 C^GTC GAC GAG CCG ATC GCC ATC GTC GGC ATG GCG. . . 
I-.PPTAIVGMA 



B. 

KSofM2 

.CCC G 

p V D E P I A 
KS CGG*GCC GGT GAG CCG ATC GCG ATC GTC GGC ATG GCG 

""ragepiaivg^a 



. FU CCC GTC GAC GAG CCG ATC GCG ATC GTC GGC ATG GCG, 

P V D E P I A I V G M A 

' ,„r- i BsaBI ICS of MS 

M1-M2 Linker 



Fieure 2 5 LDD-MI-M5-TE. (A) Schematic diagram showing the 
major polykctide product. Curved arrow represents where intermodular 
transfer occurs. (B) Nucleotide and amino acid sequences near the 
splicine site of DEBS M2 ICS and M5 ICS. The composite sequences 
are shown below the contributing sequences. Nucleotides ; and amino 
acids of the enzyme recognition site are highlighted in bold font. 
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Chapter 2. Qualitative Analysis of Heterologous Module Fusions 

III. LDD-M1-M6-TE (pST96): This bimodular fusion was constructed with the same 
cloning strategy described for pST97 and pST142. Module 6 with the thioesterase 
domain was supplied as a Bsa BI-Eco RI fragment from a derivative of pJRJlO 25 . This 
fragment was cloned behind Ml in a derivative of P CKU2 to generate P ST96 (Figure 2.6 
A and B) 21 . . 



LDD-M1-M6-TE 



OH 



LDD 



Module 1 



Module 6 



XX 



i-CaA 




B. 




™CC M GTC GAC GAG CCG ATC GCC ATC GTC GGC ATG GCG. 
'*p V D E P I A I V G M. A 

KS GAT M GCC GAC GAC CCG ATC GCG ATC GTC GGC ATG GCC. 

"daddpi'aivgma 

FU 4c GTC GAC GAG CCG ATC GCG ATC GTC GGC ATG GCC, 
"J V D E P I A I V G M A 



M1-M2 Linker 



Bsn BJ 



ICS of M6 



Ficure 2.6 LDD-M1-M6-TE. (A) Schematic diagram showing the 
major polyketide product. Curved arrow represents where imermodular 
transfer occurs. (B) Nucleotide and amino acid sequences near the 
splicing site of DEBS M2 KS and M6 ICS. The composite sequences 
are shown below the contributing sequences. Nucleotides and amino 
acids of the enzyme recognition site are highlighted m bold font. 

IV. LDD-Ml-rifM5-TE (pSTl 14): Module 5 of the rifamycin polyketide 
synthase 31 - 32 (rifM5) contained the conserved Bsa BI site upstream of its pMcetoacyl-acyl 
ier protein synthase (KS) domain, so the site of module-module fusion required no 
However, to attach die thioesterase domain to the C-terminus of rifM5, an 



earner 



engineering. 
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KSofM2 

. . . CCC GTC GAC GAG CCG ATC GCG ATC GTC GGC ATG GCG . . . 
PVD3PIAIVGMA 

KS of rifMS 

... GAC GCG GGC GAG CCG ATC GCC ATC GTC GGC ATG GCG . . . 
DAGEPIAIVGMA 

Fusion 

... CCC GTC GAC GAG CCG ATC GCC ATC GTC GGC ATG GCG . . . 

p y D E P I A I V G M A 



M1-M2 Linker asaJ " KS of rifMS 

C. 

ACP of rifMS 

. . . CTC GCC CGG CAC CTG CGC GAC C-AA CTC GGT GGT GCG . . . 
LARKLRDELGGA 

ACPofM6 

. . . TTG GCC GAC CAC ATA GGA CAG CAG CTC GAC AGC C-G3 . . . 
LADHIGQQLDS. G. 

Fusion 

,CTC GCC CGG CAC CTG ACT AGT CAG CTC GAC AGC GGG . . . 
" ' L A R H L T S Q L D 5 G 

ACPofrifM5 _ Spcl TE 



Figure 2.7 LDD-Ml-rifM5-TE, (A) Schematic diagram showing the 
major polylcetide product Curved arrow represents where intermodular 
transfer occurs. (B) and (C) Nucleotide and amino acid sequences near 
the splicing site ofrifM5 KS aud DEBS M2 KS. and rifM5 ACP and 
DEBS M6 ACP, respectively, The composite sequences are shown 
below the contributing sequences. Nucleotides and amino acids that 
were changed for en2yme recognition site engineering are highlighted 
in bold font. 
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Spe I site was required. The Spe I site was placed downstream from its acyl carrier 
protein (ACP) domain, by using the primers 5'-CAA CCG CAT GCG GAC CGC-3' 
and 5'-TTC ACT AGT CAG GTG CCG GGC GAG-3' (nucleotides in boldface are 
complementary to the rifamycin sequence) to amplify the region between a natural Sph I 
site and the engineered Spe I site. The location of the Spe I site was chosen by aligning 
the amino acid sequences of the ACP domains of the six modules of DEBS and rifM5; 
the Spe t was placed in die region homologous with the engineered DEBS 1-TE (pCK12) 
20 and DEBS1-M3-TE (pCK13) !fi . A three fragment ligation was performed with the 
above PCR amplicon, a Bsa Bl-Sph I fragment derived from pHu72 that contained the 
remaining upstream portion of rifM5, and a plasmid containing the thioesterase domain 
to generate pSTlOO. A Bsa BI-Eco RI fragment from pSTlOO containing rifM5-TE was 
inserted behind Ml in a derivative of pCK12 to yield pSTl 14 (Figure 2.7 A, B, and C) 

V. Ml-rifM5-DEBS2-DEBS3 (pSTl 13): The DEBS 1 portion of a derivative of 
pCK7 2fi was replaced with the homologous region from pSTlOO using the Pac I and Spe I 
restriction sites to make pSTl 13 (Figure 2.8 A and B) 11 . The resulting plasmid contains 
rifM5 substituted for module 2 of DEBS. The linker between module 1 and rifM5 is 
identical to those used in the previous constructs, but in addition, this construct also 
maintains the natural linker region between modules 2 and 3. Therefore, in place of the 
thioesterase domain attached to its C-terminus, rifM5 has the C-terminus of module 2 of 
DEBS. 



2.2.2. Isolation of Polylcetides Produced Jit vivo 

To transform S. coelicolor with the above plasmids, the requisite non-methylated DNA 
was obtained via transformation into E. eott ET12567 33 . The resultant DNA was 
transformed into S. coelicolor CH999 following established protocols 34 . Transformants 
were grown on R5 medium at 30° and selected with an overlay of thiostrepton (50 g/L in 
DMSO) in 1 mL of water (500 mg/L final concentration). Multiple transformants of each 
constrjct 
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LDD-Ml-riiM5-DEBS2-DEBS3 




B. 1 
ACPpfriiM5 

. . . CTC GCC CGG CAC CTG CGC GAC GAA CTC 'GGT GGT GCG . . . 
LARHLRDEL.GGA 



ACPofM2 

. . . GTC GCC GGT TTC CTC GAC GCC GAG CTC GGC ACC GAG . . . 
V .AGFLDAELGTE 

j Fusion 

I ... CTC GCC CGG CAC CTG ACT AGT GAG CTC GGC ACC GAG . . . 
L A R H L T S E L G T E 

ACPofrifM5 S P eI C-terminus of M2 

| 

Figure 2.8 LDD-M 1 -rifM5 -DEB S 2-DEB S3 . (A) Schematic diagram 
showing the major polykeiide product. Curved arrow represents where 
intermodular transfer occurs in first polypeptide before transfer to 
DEBS2 and DEBS3. (B) Nucleotide and amino acid sequences near 
the splicing site of rifM5 ACP and DEBS M2 ACP. The composite 
sequences are shown below the contributing sequences. Nucleotides 
and amino acids that were changed, for enzyme recognition site 
engineering are highlighted in bold font 

were re-streaked onto R5 plates supplemented with sodium propionate (500 mg/L) and 

thiostrepton (50 mg/L) and were grown into confluent lawns (7 days). The agar medium 

was then homogenized and extracted three times with warm (40°C) ethyl acetate. The 

solvent extract was dried over magnesium sulfate, concentrated, and filtered through 

silica gel (1.2 x 4 cm, eluted isocratically with 50% ethyl acetate in hexanes). Evaluation 

of the crude extracts was performed via thin layer chromatography (TLC) in 50% ethyl 

acetate in hexanes and stained with vanillin- Purification of the products was carried out 

as described below. 
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2.3. Results 

2.3.1. Production of triketide ketolactone by CH999/pST97 

S. coelicolor CH999/pST97 produced triketide Jcetolactone 4 (ca. 10 mg/L), as 
determined by 'H NMR spectroscopy (Figure 2.4 C). As mentioned earlier, this product 
was the same product detected when NDK, the NAC-thioester analog of the diketide 
intermediate produced by module 1, was incubated with purified M3-TE 27 . The 
ketolactone was purified via silica gel chromatography (1 .2 x 10 cm, eluted isocratically 
with 40% ethyl acetate in hexanes) and had an R f of 0.5 in 50% ethyl acetate in hexanes. 

2.3.2. Production of triketide lactone by CH999/pST142 

S. coelicolor CH999/pSTl42 produced triketide lactone 2 (ca. 5 mg/L) and ketolactone 4 
(ca. 2 mg/L) as determined by l H NMR spectroscopy (Figure 2.5 A). The reduced 
lactone and the ketolactone were identical to those produced in the assays of NDK with 
M5-TE and M3-TE, respectively. The products were corroborated by unit mass 
spectrometry (ESI), M+ calculated for CgHwOa and C 9 H t4 0 3 : 172 and 170. Observed: 
172 and 170. MNa+ calculated for (C 9 H ]S 0 3 )Na and (CjHuC^Na: 195 and 193. 
Observed: 195 and 193. The triketides were purified via silica gel chromatography (1 .2 
x 10 cm, eluted isocratically with 50% ethyl acetate in hexanes). Inspection using ! H 
NMR indicated that the collected fractions were not. pure. Further purification was 
obtained by running a preparative TLC plate in 50% ethyl acetate in hexanes and using 
the 254 nm absorbance of the ketolactone as a marker. Rr estimates of the triketide 
lactone and ketolactone in the solvent system were 0.45 and 0.5, respectively. 

2.3.3. Production of triketide lactone by CH999/pST96, CH999/pST114 

S. coelicolor CH999/pST96 and CH999/pSTU4 produced triketide lactone 2 and 
triketide ketolactone 4 (ca. 5 mg/L), as determined by l H NMR spectroscopy (Figure 2.6, 
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Figure 2.7). Purification was carried out as for CH999/pST97. The products were 
identical to those extracted from CH999/pST142. 

2.3.4. Production of 6-deoxyerythronolide B by CH999/pST113 

S. coelicolor CH999/pSTl 13 produced 6-deoxyerythronolide B 1 (6-dEB) (ca. 2 mg/L), 
as determined by *H NMR spectroscopy (Figure 2,8 A). The product was also confirmed 
by mass spectrometry (ESI). MH+ calculated for (C 2 iH3 3 06)H+: 387. Observed: 387. 
6-dEB was purified using silica gel chromatography (1.2 x 10 cm, gradient of 25-50% 
ethyl acetate in hexanes). Observed Rf- 0.5 in 50°/o ethyl acetate in hexanes. 



r 



2.4. Discussion 

Hie ability of the fused modules to catalyze the biosynthesis of the expected polyketide 
products cogently demonstrates the feasibility of manipulating the organization of 
modules in PKSs. The robust productivity of these engineered PKSs, relative to domain 
substituted PKSs 15 ' 35 , also reflects the advantage of keeping modules as intact catalytic 
units. More importantly, however, these results highlight the crucial role of the linker 
regions located between modules. Prior to this discovery, the significance of the non- 
conserved regions containing the linkers had not been recognized, exemplified by the' 
introduction of mutations in earlier attempts of the M1-M3-TE and M1-M6-TE module 
fusions (Figure 2.9). Inspection of the aligned sequences indicates that the changes 
caused 

eryMl-M2 LAAHLAAELG GATGAEQAA? A. . .TT.AFV DEPIA'IVGM 

eryMl-*M3 LAAHLAAELG GATGAEQAA? A. ..TT.APA SEPIAIVSM 
eryMl-*M6 LAAHLAAELG GATGAEQAAP A-..TT.APA S2PIAIVGM 
eryMl-M3 LAAHLAAELG GATGAEQAA? A...TT.APV DEFIAIVSM 

eryMl-K5 LAAHLAAELG GATGAEQAAP A. . .TT.AFV DEPIAI VGM 

eryMl-M6 LAAHLAAELG GATGAEQAAP A. ..TT.AFV DSPIAIVGM 

eryMl-rifMS LAAHLAAELG GATGAEQAAP A...TT.APV DSPIAIVGM 

Figure 2.9 Alignment of heterologous intermodular fusions. Alignment 
of amino acids showing the mutations made in the inactive bimodular 
constructs (indicated with *) and the subsequent restoration in die 
active constructs. 
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by the introduction of a cloning site upstream from the M3 and M6 ICS domains 
deleteriously affected the channeling of intermediates between the modules. Reversing 
the VD to AS mutation and use of die conserved Bsa BI site established the connectivity 
needed for transfer, as reflected by the efficient production of the expected triketide 
products. Furthermore, the successful transplantation of rifM5 into the bimodular 
framework suggests that this fusion strategy may apply generally, especially given that 
the Bsa BI site is conserved in 4 of 6 and 8 of 10 modules in DEBS and rifamycin, 
respectively. 

The impact of the VD to AS alteration was greater dian anticipated, especially 
considering that M3 and M4 of DEBS both contain a serine at the second position and 
M5 and M<5 of DEBS both contain alanine at die first position. However, the identity of 
either amino acid does not appear to be as important as the presence of an acidic side 
chain. Analysis of die homologous amino acids for modules similarly attached to an 
upstream module, including M2,M4, and M6 of DEBS and modules from the rifamycin, 
epothilone, avermectin, oleandomycin, FK506, rapamycin, and pilcromycin polyketide 
synthases, indicates that in nearly every case, one of the two amino acids is aspartate or 
glutamate (Figure 2. 10). Although this trend certainly does not prove the necessity of 
the acidic group, it does suggest, in concert with the experimental results, that the 
composition of the intermodular linker region is not arbitrary and does contain important 

features, for chain transfer. 

Whereas the bimodular constructs required transfer across only one heterologous 
junction, the production of 6-dEB by pSTl 13 reflected the ability of rifMS to accept and 
donate intermediate chains across two heterologous junctions. Transfer across the second 
junction, between rifM5 and M3 of DEBS, represented a fundamentally different 
phenomenon from that observed in the bimodular constructs. The transfer in the 
bimodule fusions was expected to tolerate some degree of protein change since the 
covalent attachment ensured the close proximity of the modules, even if specific protein- 
protein interactions were mitigated. However, the interaction between rifM5 and M3 did 
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Epo6 

EpaS 

Pik5 

Rap 4 

Rif3 

Ave 6 

Fi.k2 

Ri Jf€ 

FK10 

RapI4 

Ery2 

01e2 

Rife 

Ery4 

RLfl 

Ave 2 

Ave 9 

Pi!<< 

01e< 

RaplO 



VAALSAHLAS HVVSTGDG2SARP?DTGNVAPMTKBVASLDE0GL?ALIDSSLAKAGKR 

VAALSGNLLCILFPNAGATHAPATEREKSFENDAAOLSAXRGMTDEQK^ 

PAALAAHLHEAYLApAEPAPT DTOGRVRRALAELPLDRLRDAGVLPTVLRLTGIEPEPGSGG 

PTALAARLDELF.AFAEPEPR LHEQELRRAIAGISIDXTREAGVLDTLLRLAAME G 

PEALVGYLRVSLLREADDGLD GREDDLRRVLAAVPFAREKEAGVLP7LtGLADTG7EPGTDA 

PAALAEHLGERLLPDQEASGEQAGDQLSGGSEEDVRSLLTSIPIGRLROAGI^LGrLLTLADTGRGASGAA 

PAALASHLDAE . LPRGAS DQDGAGNRKGNENG . TTAS RS TAET DALLAQLTRIiEGALVLTGLS D AP 

PLAVAaYLGAR.LV P DGT ANGNGNGNGKS E D DRLRH ALAAI AAE DAGEE R S I ADLGV D OL VQL A F 

PSALTAYLESQ.LGDGADDDAPTVLALLAEMD.SLDAADTAATFAF. . ERAAIADLLDKLSRTWKDHK— 
SAELAAHLDAL. L . . DSPI DAAGVYALLEEIN . ELDAEAVOMT . AA . . EHKAIS2LLEQLS AKWRTRNE 
ASAVAGFLDAE. LGTEVRGEAPSA. . .LAGLD. ALEGA. LPEVPA7 " 

ATAIARFLQSE . L V . GSCDPLTLMRSAI 

PLVLARHLRDE . L • G . AGDDALSVVKARL 

PLAVAEHLRDR.LFA. .ASPA ". VDIGORL 

PLALARHLRAE . LAVpSASPA DAVLAGL 

PTTLTKKLHTQ . L QPO PDKA. VAP VLAEL . 

PTTLTKKLHTQ. LVSXGWAAAE PDAA . TTFPGLPSLLSEL . 

PGELAGHLLDE . LATAAGGSWAEGTGSGDTAS . ATDRQT7AALAEL . 

PS ALADHLLEL . LAPATQPTAAP LLAEL . 

PADLAARLGDL . MNFRVQSTT LLAE I . 



EREELVQRLERMLAALRPVAQA 

.OQ. -I--E-GL A. . LLES 

. VEALL . . 
.LEKAL. . 
.LEAAI . . 
. LESAL. . 



. « ED 
-DE 
, .AG 
. . DK 

.ER. .LEAWLSSTTSS 

.DR. .LEGVLASLAPAA '. 

. ER. . VEQLLSAAASFGGPASAV 
.DR. . IEKMFTSVTFDDRQASAI 



G . . GLRL 
E. . ALSA 
A. . AAGA 
S. . ALDK 
. AAPL 



AC? 



Putative Coilcd-coll in cryM2 




Epc6 ■ . "~ 

Epc6 — - - — ~~ — 

PikS SDGGAADPGASPE. . , AS I D C L D AE AL I RM AL G P RNT — — ~ — 

Rap4 LAVPKPDSESDDE. . . AFVDEHDADALIKHVLEEER — -™ - 

Rif.2 ETTEAA. PAADDA. . .ELIDALDISGLVQRALGQT5- — ■ " 

Ave 6 AGPECAPPSGQDTPAPVSIDEKDIDDLCTIAKGHGTAPAREPADAEDSSSSRNRTHHTKSGETA** 

7LV.2 GSE£V1EHLRSLRSMV7GETGTGTASGAPDGAGSGAEDRPWAAGDGAGGGSEDGAG7PD7M\ T ASAEELFG 

Rife GC£ - " 

FK3.0 — " " " " 

Rapi* " *" ' , 

Ery2 ADA5GTGANPSGDDLGEAGVPELLEALGRELDGD - - — ~ 

01e2 DEEAPSEITKRLKILLPRFGSGG5SRGREAGC0AGEr:QDVEDAT:0ELFEVLDyE.LGWS 

Rife DESTKTGLTLRLCGLVARCXGWDQTGGETLADR LEAAS AOEVLDFIpGELGlT 

Ery< EDG.HDDVGCRLESLLRRWMSRRAOAP STSAI . . . SEDASOOELFSMLCQRFGGGEDL 

Rif7 PDG. .QRITARLRELLXAAEAAEA.RP GTS GO. . . LOTAS OE EL FALVDGLO 

Ave2 TDSASERVTLRLKSLMLRWNAPQHPTA ESADDDEKFTSATEAElFKFlpKDLGLS — 

Ave 9 DDGARTRLASRLHSLAQKLNGD pTAFD. . . LAETSOESMFALIpRSVGFSSQ ■ 

F<k4 . . GGRPEL.AARLRALAML G 03GPDAT3L0EASDD3LFSFIDKEIG35DF- 

Oi e4 DEfiTWLIATRLATLASQWTHLPVGSPGNA DNRS .GPGESGQAQE . . SGATGEKTAAWTSDDDLF 

RapIO [<D. RLSSVLNKWQR. . ISSPEEV STTALS5ASA5EILDFIDREFGDPTA — - 

Figure 2.1 1 Alignment of C-terminal intcrpolypcpude linkers. 
Alignment of regions following the ACP domain (indicated by label) of 
C-terminal modules in DEBS (Ery), rifamycin (RiO, epothilone (Epo), 
avermectin (Ave), oleandomycin (Ole), FK506 (FK), rapamycin (Rap), 
and pikromycin (Pile). Sequences are grouped according to the degree 
of homology, according to PileUp (Genetics Computer Group). The 
region predicted to contain a coiled-coil motif is also underscored 
(corresponding to the sequence of EryM2). The positively and 
negatively charged residues are in blue and red, respectively. 

appear to play a role in the protein-protein recognition for chain transfer between 

modules on separate polypeptides. Figure 2.11 shows the C-terminal linker region of M2 

along with the homologous region of modules from the above-mentioned PKSs. 

Whereas the intrapoiypeptide linkers demonstrated obvious homology, especially within 

.15 
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a given PKS (e.g. rapamycin), the C-terminal interpolypeptide linkers did not seem retain 
much homology. Comparison of the ordering of the sequences suggested that even 
linkers from the same PKS had few features in common. The diversity of these linkers 
may reflect their suspected ability to selectively interact with other polypeptides. 
Interestingly, their putative partners, the N-tenninal interpolypeptide linkers, contain 
relatively high homology in comparison. Across the different PKSs, the partem of 



2po7 "MTDREGQLLERLaEVTLALRKTLNERDTLS . . . t.eiCTEPIAI 

Epo9 MATT MAG KLEHALLLMDXLAKKNASLE. . .QERTEPIAI 

Rap5 - — MREDQLJjDAkRKSVXHWARLRKANTSLRAAMD . • • EPIAI 

Ri£4 MANQSWRK^SAPNEQIVQALRASLKEKVRLQGENSALAAAAA. . .SPVAI 

Pik6 KTSSECEQLVDALRAS^KENEELRKESRRRADRRQ. . .SPMAI 

Ave7 - ' -KDTSSEXLVDALRAS IjKAUQTLRARWEQLAAAMEASSEP I A - 

Rif 7 — MVSASYESCWEALRKS^aSVGTLKKRNRQLA. . . DAAGEprAI 

Ery3 * MTDSEKVAE^LRRATLOIjRAARQRIRELES DPIAI 

Ole3 — MTts'DEXlVSYLKRATVDLRKARHRIWELBD SPIAI 

Ery5 *-**MSGDNGMTE.EKLRRYIiKRTVTELDSVTARI,REVEHR, . .AGE?*** 

OleS ■ MAEAEXLREYLWRATTfSLKEVS'ORIiRETEER. . .AR& pIAI 

AvelO — **MA.NEEKLRDYLKRVTADLLNVRRRLQQIESG. . .EQ£ pIAI 

Avg2 MQLA. NEAKLLBYLKRVTADLDRTRRRLYEVVER. - .EQSPIAI 

PikS MAWNE DKLRD YLKRVTAELQQNT RRLRE I EGR - . >7BSP VAI 

Pik3 — MSTVNEEKYLDYLRRATADLHEARGRLRELEAK- . . AGEP^AI 

Rife I-ADEGQLRDYLKRAIADARDARTRIREVEEQ. . ,A&E pIAI 

FK7 — -MPEOOXTVEYLRWAOTELQRTRAEL AA. .HS.EP^AI 

Rap II ' K?£Q DKVVg YLRWATA^LHTTRAKLE ALAAA , .NT. E? LA I 

Rif 9 -MArDEKLLKYLKRVTAELHSLRKQ GAR.. . HADSPLAV 

Epo3 -MTTRGPTACQNPr.KQAAIIIQRiEERLAGLAQAELERTISI?IAI 



Putative coiled-coil 



Figure 2.12 Alignment of N-terminal interpolypeptide linkers. 
Alignment of regions preceding the KS domain (indicated by label) of 
N-terminal modules in DEBS (Ery), rifamycin (Rif), epothilone (Epo), 
avermectm (Ave), oleandomycin (Ole), FK506 (FK), rapamycin (Rap)- 
and pikromycin (Pik). Sequences are grouped according to the degree 
of homology, according to PitelJp (Genetics Computer Group). Th e 
region predicted to contain a coiled-coil motif is also underscored 
(corresponding to the sequence of EryM3). The positively and 
negatively charged residues are in blue and red, respectively, 

charged residues remained consistent. However, like the C-terminal linkers, the modules 
within a given PKS do not seem to cluster as closely as for "the intrapolypeptide linkers. 
The high proportion of charged residues on the interpolypeptide linkers implicate their 
exposure to solvent and may also serve as a means of specific recognition. This feature i< 
analyzed in greater detail in Chapters 4 and 5. 

The results of these in vivo experiments revealed the heretofore unrealized 
importance of the short, non-conserved linker regions. Although proper linker 
maintenance appears to facilitate functional reorganization of modules, the catalytic 

36 
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Zpoi-S 

Epo7-9 

Epo3-4 

Epo5-6 

Ave5-6 

Olel-2 

PK9-1Q 

Rapi3-14 

Ery3-4 

Ery5-6 

Avg3-4 

Avg4-S 

Rapll-12 

Rap9-10 

Rao2-3. 

Rap6-7 

Rapl-2 

Rap3-4 

RapG-9 

Rap'-5 

Rapl2-13 

Rap5-S 

FK7-6 

FK9-9 

Rif4-5 

Rif9-lC 

AvelO-11 

Avell-12 

Ave7-3 

Av a e-9 

Eryl-2 

Ave 1-2 

Fi!<3-4 

Rif2-3 

Rifl-2 

OLe3-4 

Pikl-2 

01e5-6 



FDKMVQRLVEHLLVDVL. 
FDKFTVERLVAHLLTOVL. 
FDHPTVERLVEYJiLSQAL . 
FDHFTVDALTRWLLDKVL . 
FDYPT PQECAAHLRTQLV . 



. KWDR • . S DX QKVRSLA5 OE P . I AI 

.KLEDa. .SDT RHIRSVAADDD. IAI 

.ELQDS. .TDV RSVRLPATEDP. IAI 

. AVAE^SVSPA KSSPQVALDEP . IAV 

. DLDDEEDAALS NALPQVAHRRTVEDEP . IAI . 

FDYARPAALAGHLRSRLI DODGE. . -H . . .GALEGV- . EKHAIDEP. IAI 

FSHPTAEALATHLLELL D.APAALTTGAPLPAVTAAPGTAARDQ. . .DEP. IAI 

FDYPT AN ALAAHLLGKL DIPPVQQRLEAPAPSTVTGPADPVADSPSANEF . IAI 

FDHPTVTALAQHLRARLV , , .G. . CADQ. . AAVRV . . . VGAA DESEP. IAI 

.GAAEAECAFALVRE. ..VP?K DADDP.IAI 

VGADTGTTTRLPLTAV PADEF . IAI 

A3FQPDNSTPLPRPRA , ELDER . IAI 

. TGEfcJPV FVRGPVSAV AQDE? . LAI 

. TGSNPA PVRGPVSAV AQDEP . LAI 

.TGEHPV. .... .PVRGPVSAV AQDIP.LAI 

.TGENPA PVREPVPAV AQDEP.LAI 

.TGEWPA PVRTSVSW AQDEP . IAI 

.TGEMPA FVKGPVSW GQDEP. LAI 

.TGEKPV QVRTPVSAV GQOEP . LAI 

.VGETPV PVRTSVSW AQDEP.LAI 



' FDHPTITRLADHYLERLV . 
FDYPTPMALCQFLRAAI. . 
FDYPSPTKLAQFLLSEI. . 
FDYPTPAALAARL . DELF . 
FDYPT PTALAARL, DELF , 
FDYPT PTALAVRL . GELF . 
FDYPTPXALAARX . DELF , 
FDYPTPAALAARL. SELF. 
FDYPT PTVLAARL . OSLF . 
FOYPTPGALAARL . EELF . 
FOYPTFTALAARL f GEWF . 
FDYFTPAILATRL.GELF. 
FDYPT PAVLAARL. GELF. 
FDFPTPRALAARLGDELA. 
FDHPTPRVLAAELRTDLF. 
FDYPT FVALARHLREEL. . 
FDYPT PVALARYLRDEL . . 



■ LAV 



. TGEMPA PVRPSVSVV GQDEP . 

. TGEUPV LVR . TASW GQCEP . LAI 

. GTRAPV AARTAATAA AH OEP . LAI 

, GTAAPL PARTART RHDE? . LAI 

. .GETVAGAFATPV TTVA DAGEP. IAI 

. .GDTVATTPVA.T AAAA DAGEP.IAI 

FDYPN PQAIARHLRDEL GG.AAQTPVTTA AAKA DLCEP, IAI 

FDYPTPSALAGYLKEQLE. . .EGA. . .GGQROIAPPVPASRV OVOEP. IAI 

. .GGSPTCGTALALALRAPAPRV DVDEF . IAI 

, . GSDSTASIF. . .APRAAAVPA DQDEP.VAI 

.GDDAEGETP. . . 5.QVALHQV AACEP.IAI 

.G. .ATGAEQ- - - AAFATIAf V DEP. IAI 

, .G TAAES AF S AAAVT AE A SVTEP.IAI 

. GEQAGAG2QLPV . DGGV DOEP . VAI 

, .GERPAA. . PAFV .TROV. . SDEF.IAI 

, .G. .SAANRPAEI.GTAA AEEP.IAI 

. GVSGAPADLS WRMADE EDDP.WI 

, . GDEE - ADARRSAALPATVGAGASAGAGTDADDD? . IAI 



fdhprpvajaahihsqts . . 
fdhpnpttlthklhtqll. , 
fdqpnaatiarhlrrelk. . 
fdhpdvrtlaahlaaelg. . 
fdkptpaklavhlqnqlr. . 
f dy pt prt lae fllae i l . , 
fdyphpavladhlraell . . 
fdypkp?aladhlra;<lf. . 
fdypttnglaeylrs5lf. . 
fdhptplalvsllrsefl. , 



FDHPNA?AIAR FLQSQLLPDAESSSAVPSSPEDEVRQALASLSLDQLKGAGLL3PUAL 
ACP KS 



Figure 2.10 Alignment of intrapolypeptide linkers. Alignment of 
regions linking covalently attached modules in DEBS (Ery), rifamycin 
(Kit), epothilone (Epo), avermectiti (Ave), oleandomycin (0!e), FK506 
(FK), rapamycin (Rap), and pikroraycin (Pik). Sequencer are grouped 
according to the degree of homology, according to PileUp (Genetics 
Computer Group). The intrapolypeptide linker sequences are flanked 
by the consensus ACP.and KS sequences, as indicated by the labels 
under the list. The positively and negatively charged residues are in 
blue and red, respectively. 

not contain such a crutch since they reside on different polypeptides. Therefore, the 
observed production of 6-dEB implied that some recognition of rifMS and M3 was 
retained. Because the C-tenninus, fused to rifMS, was the only feature remaining from 
M2, the ca. 90 amino acid linker region was implicated in the recognition. Thus, not only 
do the linkers appear essential for chain transfer between fused modules, but they also 
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